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X. 

^ . Abstract 

We compare different procedures for combining fixed-order tree-level matrix- 
element generators with parton showers. We use the case of W-production at the 
Tevatron and the LHC to compare different implementations of the so-called CKKW 
and MLM schemes using different matrix-element generators and different parton 
cascades. We find that although similar results are obtained in all cases, there are 
important differences. 
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1 Introduction 



One of the most striking features of LHC final states will be the large number of events with 
several hard jets. Final states with 6 jets from tt decays will have a rate of almost 1 Hz, 
with 10-100 times more coming from prompt QCD processes. The immense amount of 
available phase space, and the large acceptance of the detectors, with calorimeters covering 
a region of almost 10 units of pseudo-rapidity (77), will lead to production and identification 
of final states with 10 or more jets. These events will hide or strongly modify all possible 
signals of new physics, which involve the chain decay of heavy coloured particles, such as 
squarks, gluinos or the heavier partners of the top, which appear in little-Higgs models. 
Being able to predict their features is therefore essential. 

To achieve this, our calculations need to describe as accurately as possible both the 
full matrix elements for the underlying hard processes, as well as the subsequent devel- 
opment of the hard partons into jets of hadrons. However, for the complex final-state 
topologies we are interested in, no factorization theorem exists to rigorously separate 
these two components. The main obstacle is the existence of several hard scales, like the 
jet transverse energies and di-jet invariant masses, which for a generic multi-jet event will 
span a wide range. This makes it difficult to unambiguously separate the components 
of the event, which belong to the "hard process" (to be calculated using a multi-parton 
amplitude) from those developing during its evolution (described by the parton shower). 
A given {n + l)-jet event can be obtained in two ways: from the coUinear /soft-radiation 
evolution of an appropriate {n + l)-parton final state, or from an n-parton configuration 
where hard, large-angle emission during its evolution leads to the extra jet. A factoriza- 
tion prescription (in this context this is often called a "matching scheme" or "merging 
scheme" ) defines, on an event-by-event basis, which of the two paths should be followed. 
The primary goal of a merging scheme is therefore to avoid double counting (by prevent- 
ing some events to appear twice, once for each path), as well as dead regions (by ensuring 
that each configuration is generated by at least one of the allowed paths). Furthermore, a 
good merging scheme will optimize the choice of the path, using the one, which guarantees 
the best possible approximation to a given kinematics. It is possible to consider therefore 
different merging schemes, all avoiding the double counting and dead regions, but leading 
to different results in view of the different ways the calculation is distributed between the 
matrix element and the shower evolution. As in any factorization scheme, the physics 
is independent of the separation between phases only if we have complete control over 
the perturbative expansion. Otherwise a residual scheme-dependence is left. Exploring 
different merging schemes is therefore crucial to assess the systematic uncertainties of 
multi-jet calculations. 

In this work we present a comprehensive comparison, for W plus multijet produc- 
tion, of three merging approaches: the CKKW scheme, the Lonnblad scheme, and the 
MLM scheme. Our investigation is an evolution and extension of the work in [1], where 
Mrenna and Richardson presented implementations of CKKW for HerwiG and the so- 
called pseudo-shower alternative to CKKW using Pythia, as well as the results of an 
approach inspired by the MLM-scheme. Our work considers the predictions of five dif- 
ferent codes, Alpgen, Ariadne, Helac, MadEvent and Sherpa. Alpgen implements 
the MLM scheme, and the results shown here are obtained with the HerwiG shower; 
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Ariadne the Lonnblad scheme; Helac the MLM scheme, but will show results with the 
Pythia shower; MadEvent uses a variant of the MLM scheme, based on the CKKW 
parametrization of the multiparton phase-space; Sherpa, finally, implements the CKKW 
scheme. This list of codes therefore covers a broad spectrum of alternative approaches 
and, in particular, includes all the programs used as reference event generators for multijet 
production by the Tevatron and LHC experimental collaborations; for those, we show re- 
sults relative to publically available versions, therefore providing valuable information on 
the systematics involved in the generation of multijet configurations by the experiments. 
A preliminary study, limited to the Alpgen, Ariadne and Sherpa codes, was presented 
in [2]. 

While [1] devoted a large effort to discussing the internal consistency and validation of 
the meging schemes, we refer for these more technical aspects to the papers documenting 
the individual implementations of the meging algorithms in the codes we use [3-7], and we 
shall limit ourselves here to a short review of each implementation. We concentrate instead 
on comparisons among physical observables, such as cross sections or jet distributions, 
which we study for both the Tevatron and the LHC. The main goal is not an anatomy of 
the origin of possible differences, but rather the illustration of their features and their size, 
to provide the experimentalists with a quantitative picture of systematics associated to the 
use of these codes. We furthermore verify that, with only a few noteworthy exceptions, the 
differences among the results of the various codes are comparable in size with the intrinsic 
systematics of each approach, and therefore consistent with a leading-logarithmic level of 
accuracy. The quantaties we present correspond to experimental observables and the 
differences between the predictions of the various codes that we present could therefore 
be resolved by comparing with data. 

We begin the paper with a short review of the merging prescriptions and of their 
implementations in the 5 codes. We then introduce the observables considered for this 
study, and present detailed numerical results for both the Tevatron and the LHC. We 
then provide with an assessment of the individual systematics of each code, and a general 
discusison of our findings. 

2 Merging procedures 

In general, the different merging procedures follow a similar strategy: 

1. A jet measure is defined and all relevant cross sections including jets are calcu- 
lated for the process under consideration. I.e. for the production of a final state 
X in pp-coUisions, the cross sections for the processes pp — > X + n jets with 
n = 0, 1, . . . , = rimax are evaluated. 

2. Hard parton samples are produced with a probability proportional to the respective 
total cross section, in a corresponding kinematic configuration following the matrix 
element. 

3. The individual configurations are accepted or rejected with a dynamical, kinematics- 
dependent probability that includes both effects of running coupling constants and 
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of Sudakov form factors. In case the event is rejected, step 2 is repeated, i.e. a new 
parton sample is selected, possibly with a new number of jets. 

4. The parton shower is invoked with suitable initial conditions for each of the legs. In 
some cases, like, e.g. in the MLM procedure described below, this step is performed 
together with the step before, i.e. the acceptance/rejection of the jet configuration. 
In all cases the parton shower is constrained not to produce any extra jet; stated in 
other words: configurations that would fall into the realm of matrix elements with 
a higher jet multiplicity are vetoed in the parton shower step. 

The merging procedures discussed below differ mainly 

• in the jet definition used in the matrix elements; 

• in the way the acceptance/rejection of jet configurations stemming from the matrix 
element is performed; 

• and in details concerning the starting conditions of and the jet vetoing inside the 
parton showering. 

2.1 CKKW 

The merging prescription proposed in [8,9] is known as the CKKW scheme and has been 
implemented in the event generator Sherpa [10] in full generality [11]. 
In this scheme 

• the separation of the matrix-element and parton-shower domains for different multi- 
jet processes is achieved through a /cj_-measure [12-14], where k±o denotes the in- 
ternal separation cut, also called the merging scale; 

• the acceptance/rejection of jet configurations proceeds through a reweighting of the 
matrix elements with analytical Sudakov form factors and factors due to different 
scales in a^; 

• the starting scale for the parton shower evolution of each parton is given by the 
scale where it appeared first; 

• a vetoed parton-shower algorithm is used to guarantee that no unwanted hard jets 
are produced during jet evolution. 

In the original paper dealing with e~^e~ annihilations into hadrons, [8], it has been 
shown explicitly that in this approach the dependence on k±Q cancels to NLL accuracy. 
This can be achieved by combining the Sudakov-reweigthed matrix elements with a vetoed 
parton shower with angular ordering, subjected to appropriate starting conditions. The 
algorithm for the case of hadron-hadron collisions has been constructed in analogy to the 
e~^e~ case. However, it should be stressed that it has not been shown that the CKKW 
algorithm is correct at any logarithmic order in this kind of process. 
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For hadron-hadron collisions, the internal jet identification of the SHERPA-merging 
approach proceeds through a k±-scheme, which defines two final-state particles to belong 
to two different jets, if their relative transverse momentum squared 



k'iij = 2min{p^i, 



[cosh(?7^^ - ?7j) - cos{(j)i - 



(1) 



is larger than the critical value k'^Q. In addition, the transverse momentum of each jet 
has to be larger than the merging scale k±Q. The magnitude D, which is of order 1, is a 
parameter of the jet algorithm [15]. In order to completely rely on matrix elements for jet 
production allowed by the external analysis, the internal D should be chosen less than or 
equal to the D-parameter or, in case of a cone-jet algorithm, the i?-parameter employed 
by the external analysis. 

The weight attached to the generated matrix elements consists of two components, 
a strong-coupling weight and an analytical Sudakov form-factor weight. For their deter- 
mination, a k±-iet clustering algorithm guided by only physically allowed parton com- 
binations is applied on the initial matrix-element configurations. The identified nodal 
k±-valnes are taken as scales in the strong-coupling constants and replace the predefined 
choice in the initial generation. The Sudakov weight attached to the matrix elements ac- 
counts for having no further radiation resolveable at k^Q. The NLL-Sudakov form factors 
employed, cf. [12], are defined by 



^q{Q,Qo) = exp l~ J dqTg{Q,q)\ , 
[ Qo } 
( Q 

A,(Q,go) = expl- J dq[Tg{Q,q)+Tf{q)] 



where Fg are the integrated splitting functions q ^ qg, g ^ gg and g 
given through 
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qq, which are 

(3) 
(4) 
(5) 



They contain the running coupling constant and the two leading, logarithmically enhanced 
terms in the limit go ^ g. The single logarithmic terms —3/4 and —11/12 may spoil an 
interpretation of the NLL-Sudakov form factor as a non-branching probability. Therefore, 
F(g, q) is cut off at zero, such that Aq^g{Q, Qq) retains its property to define the probability 
for having no emission resolvable at scale go during the evolution from g to go. These 
factors are used to reweight in accordance to the appearance of external parton lines. 
A ratio of two Sudakov form factors A(g, go)/A(g, go) accounts for the probability of 
having no emission resolvable at go during the evolution from Q to q. Hence, it is 
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employed for the reweighting according to internal parton lines. The lower limit is taken 
to be Qq = kj_Q OT Qo = D kj_Q for partons that are clustered to a beam or to another 
final state parton, respectively. 

The sequence of clusterings, stopped after the eventual identification of a 2 — > 2 
configuration (the core process), is used to reweight the matrix element. Moreover, this 
also gives a shower history, whereas the 2^2 core process defines the starting conditions 
for the vetoed shower. For the example of an identified pure QCD 2^2 core process, the 
four parton lines left as a result of the completed clustering will start their evolution at 
the corresponding hard scale. Subsequently, additional radiation is emitted from each leg 
by evolving under the constraint that any emission harder than the separation cut k±o is 
vetoed. The starting scale of each leg is given by the invariant mass of the mother parton 
belonging to the identified QCD splitting, through which the considered parton has been 
initially formed. 

Finally, it should be noted that the algorithm implemented in Sherpa does the merging 
of the sequence of processes pp ^ X + n jets with n = 0, 1, . . . , fully automatically - 
the user is not required to generate the samples separately and mix them by hand. 



2.2 The Dipole Cascade and CKKW 

The merging prescription developed for the dipole cascade in the ARIADNE program [16] 
is similar to CKKW, but differs in the way the shower history is constructed, and in 
the way the Sudakov form factors are calculated. Also, since the ARIADNE cascade is 
ordered in transverse momentum the treatment of starting scales is simplified. Before 
going into details of the merging prescription, it is useful to describe some details of the 
dipole cascade, since it is quite different from conventional parton showers. 

The dipole model [17, 18] as implemented in the ARIADNE program is based around 
iterating 2 — 3 partonic splittings instead of the usual 1 — 2 partonic splittings in 
a conventional parton shower. Gluon emission is modeled as coherent radiation from 
colour-anti-colour charged parton pairs. This has the advantage of eg. including first 
order corrections to the matrix elements for e~^e~ — > gg in a natural way and it also 
automatically includes the coherence effects modeled by angular ordering in conventional 
showers. The process of quark-ant i- quark production does not come in as naturally, but 
can be added [19]. The emissions in the dipole cascade are ordered according to an 
invariant transverse momentum defined as 

(1± = ' (6) 

■5l23 

where Sij is the squared invariant mass of parton i and j, with the emitted parton having 
index 2. 

When applied to hadronic collisions, the dipole model does not separate between 
initial- and final-state gluon radiation. Instead all gluon emissions are treated as com- 
ing from final-state dipoles [20,21]. To be able to extend the dipole model to hadron 
collisions, spatially extended coloured objects are introduced to model the hadron rem- 
nants. Dipoles involving hadron remnants are treated in a similar manner to the normal 
final-state dipoles. However, since the hadron remnant is considered to be an extended 
object, emissions with small wavelength are suppressed. This is modeled by only allowing 
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a fraction of the remnant to take part in the emission. The fraction that is resolved during 
the emission is given by 



where /i is the inverse size of the remnant and a is the dimensionahty. These are semi- 
classical parameters, which have no correspondence in conventional parton cascades, 
where instead a suppression is obtained by ratios of quark densities in the backward 
evolution. The main effect is that the dipole cascade allows for harder gluon emissions 
in the beam directions, enabling it to describe properly eg. forward jet rates measured at 
HERA (see eg. [22]). 

There are two additional forms of emissions, which need to be included in the case 
of hadronic collisions. One corresponds to an initial state g ^ qq [23]. This does not 
come in naturally in the dipole model, but is added by hand in a way similar to that of 
a conventional initial-state parton shower [23]. The other corresponds to the initial-state 
q —>■ gq (with the gluon entering into the hard sub-process), which could be added in a 
similar way, but this has not yet been implemented in ARIADNE. 

When implementing CKKW for the dipole cascade [6,24], the procedure is slightly dif- 
ferent from what has been described above. Rather than using the standard /c^-algorithm 
to cluster the state produced by the matrix-element generator, a complete set of inter- 
mediate partonic states. Si, and the corresponding emission scales, q±i are constructed, 
which correspond to a complete dipole shower history. Hence, for each state produced by 
the matrix-element generator, basically the question how would ARIADNE have generated 
this state is answered. Note, however, that this means that only coloured particles are 
clustered, which differs from eg. SherpA, where also the W and its decay products are 
involved in the clustering. 

The Sudakov form factors are then introduced using the Sudakov veto algorithm. The 
idea is that we want to reproduce the Sudakov form factors used in ARIADNE. This is 
done by performing a trial emission starting from each intermediate state Si with q±i as a 
starting scale. If the emitted parton has a qj_ higher than q±i+i the state is rejected. This 
correspond to keeping the state according to the no-emission probability in ARIADNE, 
which is exactly the Sudakov form factor. 

It should be noted that for initial-state showers, there are two alternative ways of 
defining the Sudakov form factor. The definition in eq. ([2]) is used in eg. HerwiG [25], 
while eg. PythiA [26,27] uses a form, which explicitly includes ratios of parton densities. 
Although formally equivalent to leading logarithmic accuracy, only the latter corresponds 
exactly to a no-emission probability, and this is the one generated by the Sudakov veto 
algorithm. This, however, also means that the constructed emissions in this case need 
not only be reweighted by the running as in the standard CKKW procedure above, 
but also with ratios of parton densities, which in the case of gluon emissions correspond 
to the suppression due to the extended remnants in eq. ([7]) as explained in more detail 
in [6], where the complete algorithm is presented. 

2.3 The MLM procedure 

The so-called MLM "matching" algorithm is described below. 




(7) 
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1. The first step is the generation of parton-level configurations for all final-state parton 
multiplicities n up to a given {W + N partons). They are defined by the following 
kinematical cuts: 

pPart ^ ^^in ^ |^^^^^| ^ ^^^^ ^ ^^^^ > j^,^ ^ (g) 

where p^^^ and r/part are the transverse momentum and pseudo-rapidity of the final- 
state partons, and AR^^ is their minimal separation in the [r], </>) plane. The param- 
eters p™™, ?7max and -Rmin are called generation parameters, and are the same for all 
n = l, N. 

2. The renormalization scale is set according to the CKKW prescription. The nec- 
essary tree branching structure is defined for each event, allowing however only 
for branchings, which are consistent with the colour structure of the event, which 
in Alpgen is extracted from the matrix-element calculation [28]. For a pair of 
final-state partons i and j, we use the /c_L-measure defined by 

d,, = ARlmm{pl„pl^), (9) 

where ARfj = Arjfj + A(j)fj, while for a pair of initial/final-state partons we have 

d,,=pl, (10) 

i.e. the p\ of the final-state one. 

3. The k±-vaAue at each vertex is used as a scale for the relative power of a^. The 
factorization scale for the parton densities is given by the hard scale of the process, 
Qo = f^'w'^P\.w- '^^y happen that the clustering process stops before the lowest- 
order configuration is reached. This is the case, e.g., for an event like uu — Wcsg. 
Flavour conservation allows only the gluon to be clustered, since uu — >■ Wcs is a 
LO process, first appearing at In such cases, the hard scale Qo is adopted 
for all powers of corresponding to the non-merged clusters. 

4. Events are then showered, using Pythia or HerwiG. The evolution for each parton 
starts at the scale determined by the default Pythia and HerwiG algorithms on 
the basis of the kinematics and colour connections of the event. The upper veto 
cutoff to the shower evolution is given by the hard scale of the process, Qq. After 
evolution, a jet cone algorithm is applied to the partons produced in the perturbative 
phase of the shower. Jets are defined by a cone size -Rcius, a minimum transverse 
energy Ef^ and a maximum pseudo-rapidity ?7max- These parameters are called 
matching parameters, and should be kept the same for all samples n = 0, 1, . . . , N. 
These jets provide the starting point for the matching procedure, described in the 
next bullet. In the default implementation, we take -Rcius = -Rmin, ''7max = ''7max 
and Ef^'^ = + max(5 GeV, 0.2 x but these can be varied as part of the 
systematics assessment. To ensure a complete coverage of phase space, however, it 
is necessary that i?cius > ^min, '7max < '7max and Ef"' > j9^'°. 
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5. Starting from the hardest parton, the jet, which is closest to it in (77, 0) is selected. 
If the distance between the parton and the jet centroid is smaller than 1.5 x -Rcius? 
we say that the parton and the jet match. The matched jet is removed from the 
list of jets, and the matching test for subsequent partons is performed. The event 
is fully matched if each parton matches to a jet. Events, which do not match, 
are rejected. A typical example is when two partons are so close that they cannot 
generate independent jets, and therefore cannot match. Another example is when 
a parton is too soft to generate its own jet, again failing matching. 

6. Events from the parton samples with n < N, which survive matching, are then 
required not to have extra jets. If they do, they are rejected, a suppression, which 
replaces the Sudakov reweighting used in the CKKW approach. This prevents the 
double counting of events, which will be present in, and more accurately described 
by, the n + 1 sample. In the case of n = A^, events with extra jets can be kept since 
they will not be generated by samples with higher n. Nevertheless, to avoid double 
counting, we require that their transverse momentum be smaller than that of the 
softest of the matched jets. 

When all the resulting samples from n = 0, . . . , N are combined, we obtain an inclusive 
H^+jets sample. The harder the threshold for the energy of the jets used in the matching, 
E'^^, the fewer the events rejected by the extra-jet veto (i.e. smaller Sudakov suppression), 
with a bigger role given to the shower approximation in the production of jets. Using lower 
thresholds would instead enhance the role of the matrix elements even at lower E±, and 
lead to larger Sudakov suppression, reducing the role played by the shower in generating 
jets. The matching/rejection algorithm ensures that these two components balance each 
other. This algorithm is encoded in the Alpgen generator [29,30], where evolution with 
both HerwiG and PythiA are enabled. However, in the framework of this study, the 
parton shower evolution has been performed by HerwiG. 

2.4 The MadEvent approach 

The approach used in MadGraph/MadEvent [31,32] is based on the MLM prescription, 
but uses a different jet algorithm for defining the scales in and for the jet matching. 
The phase-space separation between the different multi-jet processes is achieved using 
the /c^-measure as in Sherpa (eq. ([T]) with D = 1), while the Sudakov reweighting is 
performed by rejecting showered events that do not match to the parton-level jets, as 
in Alpgen. This approach allows more direct comparisons with Sherpa, including the 
effects of changing the fc^-cutoff scale. The details of the procedure are as follows. 

Matrix-element multi-parton events are produced using MadGraph/MadEvent ver- 
sion 4.1 [33], with a cutoff Q^in clustered k±. The multi-parton state from the matrix- 
element calculation is clustered according to the fc^-algorithm, but allowing only cluster- 
ings that are compatible with the Feynman diagrams of the process, which are provided 
to MadEvent by MadGraph. The factorization scale, i.e., the scale used in the parton 
densities, is taken to be the clustering momentum in the last 2^2 clustering (the "cen- 
tral process"), usually corresponding to the transverse mass, m^, of the W boson. The 
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fc^-scales of the QCD clustering nodes are used as scales in the calculation of the various 
powers of a^. 

As in the Alpgen procedure, no Sudakov reweighting is performed. Instead, the 
virtuality-ordered shower of Pythia 6.4 [34] is used to shower the event, with the starting 
scale of the shower set to the factorization scale. The showered (but not yet hadronized) 
event is then clustered to jets using the /cj_-algorithm with a jet measure cutoff Q^in > 
Q^^, and the matrix-element partons are matched to the resulting jets, in a way, which 
differs from the standard MLM procedure. A parton is considered to be matched to the 
closest jet if the jet measure (^(parton, jet) is smaller than the cutoff Qmin- Events where 
not all partons are matched to jets are rejected. For events with parton multiplicity 
smaller than the highest multiplicity, the number of jets must be equal to the number of 
partons. For events with the highest multiplicity, jets are reconstructed, and partons 
are considered to be matched if Q(parton, jet) < Q^^^°^, the smallest fc^-measure in the 
matrix-element event. This means that extra jets below Q^^^°^ are allowed, similarly to 
the Sherpa treatment. 

Note that also the standard MLM scheme with cone jets is implemented as an alter- 
native in MadEvent and its Pythia interface. 

2.5 Helac implementation of the MLM procedure 

In Helac [35, 36] we have implemented the MLM procedure as described above, see sec- 
tion 12.31 Helac generates events for all possible processes at hadron and lepton colliders 
within the Standard Model and has been successfully tested with up to 10 particles in the 
final state [36-38]. 

The partons from the matrix-element calculation are matched to the jets constructed 
after the parton showering. The parton-level events are generated with a minimum p^min 
threshold for the partons, p±j > p±mm, a minimum parton separation, ARjj > i?min, and a 
maximum pseudo-rapidity, |?7j| < ?7max- In order to extract the necessary information used 
by the /c^-reweighting, initial- and final-state partons are clustered backwards as described 
in section 12.31 where again the colour flow information extracted from the matrix-element 
calculation is used as a constraint on the allowed clusterings. The k±-measme, dij, for 
pairs of outgoing partons is given by equation and for pairs of partons where one is 
incoming and one is outgoing by equation (fTUj) . If two outgoing partons are clustered, 
i.e. dij is minimal, the resulting parton is again an outgoing parton with p = pi + pj 
and adjusted colour flow. In the case when incoming and outgoing partons are clustered, 
the new parton is incoming and its momentum is p = pj — pi. As a result we obtain 
a chain of d-values. For every node, a factor of a;s((inodc)/«s(<5o) multiplied into the 
weight of the event. For the unclustered vertices as well as for the scale used in the 
parton density functions, the hard scale of the process Ql = + p^yy is used. No 
Sudakov reweighting is applied. The sample of events output, which is in the latest Les 
Houches event file format [39], is read by the interface to PYTHIA version 6.4 [34], where 
the virtuality-ordered parton shower is constructed. For each event, a cone jet-algorithm 
is applied to all partons resulting from the shower evolution. The resulting jets are defined 
by -E'ilmin, ''7max ^ J^^ cone sizc -Rcius- The parton from the parton-level event is 

then associated to one of the constructed jets. Starting from the parton with the highest 
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p± we select the closest jet (1.5 x -Rcius) in the pseudo-rapidity/azimuthal- angle space. 
All subsequent partons are matched iteratively to jets. If this is impossible, the event is 
rejected. Additionally, for n < N, matched events with the number of jets greater than 
n are rejected, whereas for n = N, i.e. the highest multiphcity (in this study, iV = 4), 
events with extra jets are kept, only if they are softer than the N matched jets. This 
procedure provides the complete inclusive sample. 

3 General properties of the event generation for the 
study 

We present in the following sections some concrete examples. We concentrate on the case 
of VT+multi-jet production, which is one of the most studied final states because of its im- 
portant role as a background to top quark studies at the Tevatron. At the LHC, ly+jets, 
as well as the similar Z+jets processes, will provide the main irreducible backgrounds to 
signals such as multi-jet plus missing transverse energy, typical of Supersymmetry and of 
other manifestations of new physics. The understanding of W^-l-multi-jet production at 
the Tevatron is therefore an essential step towards the validation and tuning of the tools 
presented here, prior to their utilization at the LHC. 

The CDF and D0 experiments at the Tevatron collider have reported cross-section 
measurements for VT-l-multijet final states, both from Run 1 [40-43] and, in preliminary 
form, from Run 11 [44]. The Run 1 results typically refer to detector-level quantities, 
and a comparison with theoretical predictions requires to process the generated events 
through a detector simulation. These tests were performed in the context of the quoted 
analyses, using the LO calculations available at the time, showing a good agreemnt within 
the large statistical, systematic and theoretical uncertainties. The preliminary CDF result 
from Run 11 [44] is instead corrected for all detector effects, and expressed in terms of 
true jet energies. In this form it is therefore suitable for direct comparison with theory 
predictions. Measurements of Z+multijet rates are also crucial, but suffer from lower 
statistics w.r.t. the W case. A Run 11 measurement of jet pj_ spectra in Z-|-multijet 
events from D0 has been compared to the predictions of Sherpa in ref. [45], showing 
again a very good agreement. Preliminary CDF results on the spectra of the first and 
second jet in Z+jet events have been compared against parton-level NLO results [46]. 
For both the W and Z cases, the forthcoming analyses of the high-statistics sample now 
available at the Tevatron will provide valuable inputs for more quantitative analyses of 
the codes presented here. 

For each of the codes, we calculated a large set of observables, addressing inclusive 
properties of the events (transverse momentum spectrum of the W and of leading jets) 
as well as geometric correlations between the jets. What we present and discuss here is 
a subset of our studies, which illustrates the main features of the comparison between 
the different codes and of their own systematics. A preliminary account of these results, 
limited to the Alpgen, Ariadne and Sherpa codes, was presented in [2]. More complete 
studies of the systematics of each individual code have been [3-7] or will be presented 
elsewhere by the respective authors. 

The existence in each of the codes of parameters specifying the details of the merging 
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algorithms presents an opportunity to tune each code so as to best describe the data. This 
tuning should be seen as a prerequisite for a quantitative study of the overall theoretical 
systematics: after the tuning is performed on a given set of final states (e.g. the 1^+jets 
considered here), the systematics for other observables or for the extrapolation to the 
LHC can be obtained by comparing the difference in extrapolation between the various 
codes. Here it would be advantageous if future analysis of Tevatron data would provide 
us with spectra corrected for detector effects in a fashion suitable for a direct comparison 
against theoretical predictions. 

The following two sections present results for the Tevatron {pp collisions at 1.96 TeV) 
and for the LHC {pp at 14 TeV). The elements of the analysis common to all codes are 
the following: 

• Event samples. Tevatron results refer to the combination of and W~ bosons, 
while at the LHC only are considered. All codes have generated parton-level 
samples according to matrix elements with up to 4 final-state partons, i.e. = 4. 
Partons are restricted to the light-flavour sector and are taken to be massless. The 
Yukawa couplings of the quarks are neglected. The PDF set CTEQ6L has been 
used with as{mz) = 0.118. Further standard-model parameters used were: mw = 
80.419 GeV, Tw = 2.048 GeV, mz = 91.188 GeV, Tz = 2.446 GeV, the Fermi 
constant = 1.16639 ■ 10"^ GeV-^ sin^ Ow = 0.2222 and oem = 1/132.51. 

• Jet definitions. Jets were defined using Paige's GETJET cone-clustering algorithm, 
with a calorimeter segmentation of (A?7, A0) = (0.1,6°) extended over the range 
|?7| < 2.5 (|?7| < 5), and cone size of 0.7 (0.4) for the Tevatron (LHC). At the Tevatron 
(LHC) we require jets with E± > 10 (20) GeV, and pseudo-rapidity |?7| < 2 (4.5). 
For the analysis of the differential jet rates denoted as di, the Tevatron Run 11 k±- 
algorithm [15j^ was applied to all final-state particles fulfilling |?7| < 2.5 (5). The 
fc_L-measure used in the algorihtm is given by equations and ( ITOl) . 

In all cases, except the di plots, the analysis is done at the hadron level, but without 
including the underlying event. The di plots were done to check the details of the merging 
and are therefore done at parton level to avoid any smearing effects from hadronization. 
For all codes, the systematic uncertainties are investigated by varying the merging scale 
and by varying the scale in and, for some codes, in the parton density functions. For 
Alpgen and Helac, the scale in has been varied only in the ttg-reweighting of the 
matrix elements, while for the others the scale was also varied in the parton cascade. 
Note that varying the scale in the final-state parton showers will spoil the tuning done to 
LEP data for the cascades. A consistent way of testing the scale variations would require 
retuning of hadronization parameters. However, we do not expect a strong dependence on 
the hadronization parameters in the observables we consider, and no attempt to retune 
has been made. 

The parameter choices specific to the individual codes follows: 

• AlpGEN: The parton-level matrix elements were generated with AlpGEN [29, 30] 
and the subsequent evolution used the HerwiG parton shower according to the 

^More precisely, we used the implementation in the ktclus package [47] (IMDDE=5, or 4211). 
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MLM procedure. Version 6.510 of HerwiG was used, with its default shower and 
hadronization parameters. The default results for the Tevatron (LHC) were obtained 
using parton-level cuts (see eq.®) of p™'"^ = 8 (15) GeV, rjmax = 2.5 (5), -Rmin = 
0.7 (0.4) and matching defined by Ef"' = 10 (20) GeV, = r/^ax and i?cius = 
-Rmin- The variations used in the assessment of the systematics cover: 

— different thresholds for the definition of jets used in the matching: Ef_^^ = 20 
and 30 GeV for the Tevatron, and Ef"' = 30 and 40 GeV for the LHC. These 
thresholds were applied to the partonic samples produced with the default gen- 
eration cuts, as well as to partonic samples produced with higher p™™ values. 
No difference was observed in the results, aside from an obviously better gen- 
eration efficiency in the latter case. In the following studies of the systematics, 
the two threshold settings will be referred to as Alpgen parameter sets ALptX, 
where X labels the value of the threshold. Studies with different values of -Rcius 
and -Rmin were also performed, leading to marginal changes, which will not be 
documented here. 

— different renormalization scales at the vertices of the clustering tree: /i = yUo/2 
and fi = 2 fiQ, where /xq is the default fc^- value. In the following studies of the 
systematics, these two settings will be referred to as Alpgen parameter sets 
ALscL (for "Low") and ALscH (for "High"). 

The publicly available version V2.10 of the code was used to generate all the Alpgen 
results. 

• AriADNE: The parton-level matrix elements were generated with MAdEvent and 
the subsequent evolution used the dipole shower in ARIADNE according to the pro- 
cedure outlined in section 12.21 Hadronization was performed by Pythia. 

For the default results at the Tevatron (LHC) the parton-level cuts were p±mm = 
10 (20), Rjj < 0.5 (0.4) and, in addition, a cut on the maximum pseudo-rapidity 
of jets, ?7jmax = 2.5 (5.0). The variations used in the assessment of the systematics 
cover: 

— different values of the merging scales p±min = 20 and 30 GeV for the Tevatron 
(30 and 40 GeV for the LHC). In the following studies of the systematics, these 
two settings will be referred to as ARIADNE parameter sets ARptX. 

— a change of the soft suppression parameters in eq. ([7]) from the default values 
of yU = 0.6 GeV and a = 1, to /i = 0.6 GeV and a = 1.5 (taken from a tuning 
to HERA data [48]). This setting will be referred to as ARs. 

— different values of the scale in a^: /i = /io/2 and = 2 /iq were used (ARscL 
and ARscH). This scale change was used in as evaluations in the program. 

• HelAC: The parton-level matrix elements were generated with Helac [35,36] and 
the phase space generation is performed by Phegas [49] . The subsequent evolution 
used the default virtuality-ordered shower in Pythia 6.4 [34] according to the MLM 
procedure. Hadronization was performed by PYTHIA. 
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In the present study, e'^Ug + n jets and e~Ue + n jets samples with n = 0, . . . , 4 
have been generated for Tevatron, while for LHC predictions only e'^Ug + n jets final 
states have been considered. The number of subprocesses (i.e. ud e^u^uugg is 
one for the iy+ + 4 jets) in those cases is 4, 12, 94, 158 and 620 for n = 0, 1, 2, 3, 4 
respectively, with the number of quark flavours being 4/5 for the initial/final states. 

The default results for the Tevatron (LHC) were obtained using parton-level cuts 
of p±min = 8 (15) GeV, ?7max = 2.5 (5), -Rmin = 0.7 (0.4) and matching defined by 
^frnin = 10 (20) GeV, = 2 (4.5) and R'^l = 0.7 (0.4). The variations used in 
the assessment of the systematics cover: 

— different thresholds for the definition of jets used in the matching: -Exmin — 
30 GeV for the Tevatron, and Ef^-^^ = 40 GeV for the LHC. In the following 
studies of the systematics, these two settings will be referred to as Helac 
parameter sets HELptX, where X labels the value of the threshold. 

— different renormalization scales at the vertices of the clustering tree: yU = /io/2 
and fj, = 2 /io, where /xq is the default fc^-value. In the following studies of 
the systematics, these two settings will be referred to as Helac parameter sets 
HELscL and HELscH. 

• MadEvent: The parton-level matrix elements were generated with MadEvent and 
the subsequent evolution used the Pythia shower according to the modified MLM 
procedure in section 12. 4[ Hadronization was performed by Pythia. 

For the default results at the Tevatron (LHC) the value of the merging scale has 
been chosen to kj_Q = 10 (20) GeV. The variations used in the assessment of the 
systematics cover: 

— different values of the merging scale kj_Q = 20 and 30 GeV for the Tevatron, and 
k±Q = 30 and 40 GeV for the LHC. In the following studies of the systematics, 
these two settings will be referred to as MadEvent parameter sets MEktX. 

— different values of the scales used in the evaluation of as, in both the matrix 
element generation and the parton shower: n = /xo/2 and = 2/io, where /xq 
is the default /c^-value. These two settings will be referred to as MadEvent 
parameter sets MEscL and MEscH. 

• SherpA: The parton-level matrix elements used within Sherpa have been obtained 
from the internal matrix-element generator AMEGIC++ [50]. Parton showering has 
been conducted by APACIC++ [51,52] whereas the combination of the matrix el- 
ements with this parton shower has been accomplished according to the CKKW 
procedur^. The hadronization of the shower configurations has been performed by 
Pythia 6.214, which has been made available through an internal interface. 

For the default Tevatron (LHC) predictions, the value of the merging scale has been 
chosen to k±o = 10 (20) GeV. All Sherpa predictions for the Tevatron (LHC) have 

Beyond the comparison presented here, SHERPA predictions for M^+multi-jets have already been 
vahdated and studied for Tevatron and LHC energies in [3,4]. Results for the production of pairs of 
M^-bosons have been presented in [5] . 
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been obtained by setting the internally used D-parameter (cf . eq. ([T]) in section I2.ip 
through D = 0.7 (0.4). Note that, these two choices directly determine the gener- 
ation of the matrix elements in Sherpa. The variations used in the assessment of 
the systematics cover: 

— first, different choices of the merging scale kj_Q. Values of 20 and 30 GeV, and 
30 and 40 GeV have been used for the Tevatron and the LHC case, respectively. 
In the following studies of the systematics, these settings will be referred to 
as Sherpa parameter sets SHktX where X labels the value of the internal jet 
scale. 

— and, second, different values of the scales used in any evaluation of the as 
and the parton distribution function^. Two cases have been considered, fi = 
/io/2 and /i = 2 /xq. The choice of the merging scale is as in the default run, 
where /xq denotes the corresponding /c^-values. In the subsequent studies of the 
systematics these two cases are referred to as SHERPA parameter sets SHscL 
and SHscH. It should be stressed that these scale variations have been applied 
in a very comprehensive manner, i.e. in both the matrix-element and parton- 
showering phase of the event generation. 

All Sherpa results presented in this comparison have been obtained with the pub- 
licly available version 1.0.10. 

4 Tevatron Studies 

4.1 Event rates 

We present here the comparison among inclusive jet rates. These are shown in table [H 
For each code, in addition to the default numbers, we present the results of the various 
individual alternative choices used to assess the systematics uncertainty. In table [2] we 
show the "additional jet fractions", namely the rates a{W + n + 1 jets) /a{W + n jets), 
once again covering all systematic sets of all codes. Fig. [H finally, shows graphically 
the cross-section systematic ranges: for each multiplicity, we normalize the rates to the 
average of the default values of all the codes. 

It should be noted that the scale changes in all codes lead to the largest rate varia- 
tions. This is reflected in the growing size of the uncertainty with larger multiplicities, a 
consequence of the higher powers of a^. A more detailed discussion on the effects of the 
scale changes can be found in section El Furthermore we note that the systematic ranges 
of all codes have regions of overlap. 

4.2 Kinematical distributions 

We start by showing in fig. [2] the inclusive E± spectra of the leading 4 jets. The absolute 
rate predicted by each code is used, in units of pb/GeV. The relative differences with 

^For example, the analytical Sudakov form factors used in the matrix-element reweighting hence vary 
owing to their intrinsic a^-coupling dependence. 
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Table 1: Cross sections (in pb) for the inclusive jet rates at the Tevatron, according to 
default and alternative settings of the various codes. 
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Table 2: Cross-section ratios for {n + l)/ri inclusive jet rates at the Tevatron, according to the 
default and alternative settings of the various codes. 
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Figure 1: Range of variation for the Tevatron cross-section rates of the five codes, normahzed 
to the average value of the default settings for all codes in each multiplicity bin. 
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Figure 2: Inclusive E± spectra of the leading 4 jets at the Tevatron (pb/GeV). In all cases the 
full line gives the ALPGEN results, the dashed line gives the ARIADNE results and the "+", "x" 
and "o" points give the HELAC, MadEvent and SHERPA results, respectively. 
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Figure 3: Inclusive rj spectra of the 4 leading jets at the Tevatron. All curves are normalized 
to unit area. Lines and points are as in fig. [21 



respect to the Alpgen results, in this figure and all other figures of this section, are shown 
in the lower in-sets of each plot, where for the code X we plot the quantity (cr(X) — ctq) / o'o, 
(To being the values of the Alpgen curves. 

There is generally good agreement between the codes, except for ARIADNE, which has 
a harder E± spectra for the leading two jets. There we also find that Sherpa is slightly 
harder than Alpgen and Helac, while MAdEvent is slightly softer. 

Fig. [3 shows the inclusive t] spectra of the leading 4 jets, all normalized to unit area. 
There is a good agreement between the spectra of Alpgen, Helac and MadEvent, while 
Ariadne and Sherpa spectra appear to be broader, in particular for the sub-leading jets. 
This broadening is expected for ARIADNE since the gluon emissions there are essentially 
unordered in rapidity, which means that the Sudakov form factors applied to the matrix- 
element-generated states include also a log 1/x resummation absent in the other programs. 

Fig. UK shows the inclusive p± distribution of the W boson, with absolute normalization 
in pb/GeV. This distribution refiects in part the behaviour observed for the spectrum of 
the leading jet, with AriADNE harder than SherpA, which, in turn, is slightly harder 
than Alpgen, Helac and MadEvent. The region of low momenta, p±w < 50 GeV, 
is expanded in fig. |3)3. Fig. shows the rj distribution of the leading jet, rji, when its 
transverse momentum is larger than 50 GeV. The curves are absolutely normalized, so 
that it is clear how much rate is predicted by each code to survive this harder jet cut. 
The |?7| separation between the W and the leading jet of the event above 30 GeV is shown 
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Figure 4: (a) and (b) p± spectrum of W bosons at the Tevatron (pb/GeV). (c) Inclusive rj 
spectrum of the leading jet, for p^^^> 50 GeV; absolute normahzation (pb). (d) Pseudo-rapidity 
separation between the W and the leading jet, At] = \r]w — ^jctil, for '[^^^> 30 GeV, normalized 
to unit area. Lines and points are as in fig. [21 



in fig. mi, normalized to unit area. Here we find that AriADNE has a broader correlation, 
while HelAC and MadEvent are somewhat more narrow than AlpGEN and SherpA. 

In fig. [5] we show the merging scales di as obtained from the /c^-algorithm, where dt 
is the scale in an event where i jets are clustered into i — 1 jets. These are parton-level 
distributions and are especially sensitive to the behaviour of the merging procedure close 
to the merging/matching scale. Note that in the plots showing the difference the wiggles 
stem from both the individual codes and from the Alpgen reference. In section IH] below, 
the behaviour of the individual codes is treated separately. 

Also shown in fig. [5] is the separation in AR = y/ Arj'^ + between successive jet 
pairs ordered in hardness. The ARu is dominated by the transversal-plane back-to-back 
peak at AR12 = vr, while for larger AR in all cases the behaviour is more dictated by the 
correlations in pseudo-rapidity. For these larger values we find a weaker correlation in 
Ariadne and Sherpa, which can be expected from their broader rapidity distributions 
in fig. [31 

Finally, in fig. [6lwe show H±, the scalar sum of the transverse momenta of the charged 
lepton, the neutrino and the jets. This is a variable in which one often does experimental 
cuts in searches for new phenomena and is not expected to be very sensitive to the 
particulars in the merging schemes. The results show good agreement below 100 GeV, 
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Figure 5: (a)-(c) di {i = 1,2,3) spectra, where di is tlie scale in a parton-level event wliere i 
jets are clustered into i — \ jets using the fc^-algorithm. (d)-(f) Ai? separations at the Tevatron 
between jet 1 and 2, 2 and 3, and 3 and 4. All curves are normalized to unit area. Lines and 
points are as in fig. [21 




Figure 6: The scalar sum of the transverse momentum of the charged lepton, the neutrino and 
the jets at the Tevatron. Lines and points are as in fig. [2l 
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Figure 7: Range of variation for the LHC cross-section rates of the five codes, normahzed to 
the average value of the default settings for all codes in each multiplicity bin. 

but at higher values, as expected from the differences in the hardness of the jet and p±w 
spectra, ARIADNE has a harder spectra than Sherpa and Alpgen, while MadEvent and 
Helac has a slightly softer spectra. 

5 LHC Studies 

5.1 Event rates 

The tables (table [3] and H]) and figure (fig. E]) of this section parallel those shown earlier for 
the Tevatron. The largest rate variations is, similarly to the Tevatron rates, determined 
by the scale changes (described in more detail in section [U]). The main feature of the LHC 
results is the significantly larger rates predicted by ARIADNE (see also the discussion of 
its systematics, section [6^ . which are outside the systematics ranges of the other codes. 
Aside from this and the fact that Sherpa gives a smaller total cross section (see also 
the last part of the discussion of the SHERPA systematics in section [U7^ . the comparison 
among the other codes shows an excellent consistency, with a pattern of the details similar 
to what seen for the Tevatron. 

5.2 Kinematical distributions 

Following the same sequence of the Tevatron study, we start by showing in fig. [H] the 
inclusive E± spectra of the leading 4 jets. The absolute rate predicted by each code is 
used, in units of pb/GeV. 

Except for ARIADNE, we find good agreement among the codes, with ARIADNE having 
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Table 3: Cross sections (in pb) for the inclusive jet rates at the LHC, according to the default 
and alternative settings of the various codes. 
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Table 4: Cross-section ratios for [n + l)/n inclusive jet rates at the LHC, according to the 
default and alternative settings of the various codes. 
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Figure 8: Inclusive E± spectra of the leading 4 jets at the LHC (pb/GeV). In all cases the full 
line gives the Alpgen results, the dashed line gives the ARIADNE results and the "+", "x" and 
"o" points give the Helac, MadEvent and Sherpa results respectively. 

significantly harder leading jets, while for sub-leading jets the increased rates noted in 
fig. [7| mainly come from lower E±. Among the other codes, Helac and Sherpa have 
consistently somewhat harder jets than Alpgen, while MadEvent is a bit softer, but 
these differences are not as pronounced. 

For the pseudo-rapidity spectra of the jets in fig. it is clear that AriADNE has a much 
broader distribution in all cases. Also Sherpa has broader distributions, although not as 
pronounced, while the other codes are very consistent. 

The p± distribution of W'^ bosons in fig. follows the trend of the leading-jet E± 
spectra. The differences observed in the p±w region below 10 GeV are not due to the 
choice of merging approach, but are entirely driven by the choice of shower algorithm. 
Notice for example the similarity of the HELAC and MadEvent spectra, and their peaking 
at lower pt than the HerwiG spectrum built into the AlpGEN curve, a result well known 
from the comparison of the standard Pythia and HerwiG generators. Increasing the 
transverse momentum of the leading jet in fig. [TOb does not change the conclusions much 
for its pseudo-rapidity distribution. Also the rapidity correlation between the leading 
jet and the follows the trend found for the Tevatron, but the differences are larger, 
with a much weaker correlation for ARIADNE. Also SHERPA shows a somewhat weaker 
correlation, while HELAC is somewhat stronger than AlpGEN and MadEvent. 

For the distribution in clustering scale in fig. [11], we find again that ARIADNE is by far 
the hardest. The results given by the other codes are comparable, with the only exception 
that for the di distribution, SHERPA gives a somewhat harder prediction compared to the 
ones made by the MLM-based approaches. 
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Figure 10: (a) and (b) pj_ spectrum of bosons at the LHC (pb/GeV). (c) rj spectrum of 



the leading jet, for > 100 GeV; absolute normalization (pb). (d) Pseudo-rapidity separation 
between the and the leading jet, Ar] i ^ ieti 

area. Lines and points are as in fig. [8l 
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Figure 11: (a)-(c) di {i = 1,2,3) spectra, where di is the scale in a parton-level event where 
i jets are clustered into i — 1 jets using the fc_L-algorithm. (d)-(f) AR separations at the LHC 
between jet 1 and 2, 2 and 3, and 3 and 4. All curves are normalized to unit area. Lines and 
points are as in fig. El 
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Figure 12: The scalar sum of the transverse momentum of the charged lepton, the neutrino and 
the jets at the LHC. Lines and points are as in fig. [8j 
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The AR distributions, in fig. [TT], show at large separation a behaviour consistent with 
the broad rapidity distributions found for SherpA, and in particular for ARIADNE, in 
fig. [HI This increase at large AR is then compensated by a depletion with respect to the 
other codes at small separation. 

The scalar transverse momentum sum in fig. [12] shows significantly larger deviations as 
compared to the results for the Tevatron. ARIADNE has a much harder spectra than the 
other codes, while Sherpa and Helac are slightly harder than Alpgen and MadEvent 
is significantly softer. As in the Tevatron case, it is a direct reflection of the differences 
in the hardness of the jet and p±w spectra, although the increased phase space for jet 
production at the LHC makes the p±w contribution less important at high H_i values. 

6 Systematic studies 

In this section we present the systematic studies of each of the codes separately for both the 
Tevatron and the LHC, followed by some general comments on differences and similarities 
between the codes. 

In all cases we have chosen a subset of the plots shown in the previous sections: the 
transverse momentum of the W, the pseudo-rapidity of the leading jet, the separation 
between the leading and the sub-leading jet, and the di logarithmic spectra. As before, 
all spectra aside from p±w are normalized to unit integral over the displayed range. The 
variations of the inclusive jet cross sections has already been shown in table [TH and figs. [1] 
and [3 

To estimate what systematic error can be expected from each code, the effects of 
varying the merging scale and changing the scale used in the determination of the strong 
coupling is studied (the details for each code is described in section [3]). The merging scale 
variations are introduced according to the definition in each algorithms and should lead 
to small changes in the results, although the nonleading terms from the matrix elements 
always lead to some residual dependence on the merging scale. In the various algorithms 
different choices have been made regarding how to estimate the uncertainty from Os-scale 
variations and this leads to slightly different physical consequences. 

In the case of Alpgen and Helac, the scale changes are only implemented in the 
strong coupling calculated in the matrix element reweighting, but the scale in the shower 
remains unchanged. This leads to variations of the result that are proportional to the 
relevant power of used in the matrix element, which means that the spectra contains 
small deviations below the merging scale and that the deviations grow substantially above 
the merging scale. 

In Ariadne, MadEvent and Sherpa both the scale in the a^-reweighting and the 
scale in the of the shower is changed. In addition to this the scale used in the evaluation 
of the parton densities is also changed in SHERPA (this is discussed further in section 
Including the scale variations in ag in the shower changes the fraction of rejected events 
or the Sudakov form factors (depending on which algorithm is used), which modifies the 
cross section in the opposite direction compared to the scale changes in the matrix element 
reweighting. This leads to smaller deviations in the results above the merging scale and 
it is also possible to get significant deviations in the opposite direction below the merging 
scale, which is mainly visible in the p±w spectra. 
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Figure 13: Alpgen systematics at the Tevatron. (a) and (b) show the p± spectrum of the 
W, (c) shows the pseudo-rapidity distribution of the leading jet, (d) shows the Ai? separation 
between the two leading jets, and (e)-(g) show the di {i = 1,2,3) spectra, where di is the scale 
in a parton-level event where i jets are clustered into i — 1 jets using the A;^- algorithm. The full 
line is the default settings of Alpgen, the shaded area is the range between ALscL and ALscH, 
while the points represent ALpt20 and ALptSO as defined in section [3l 



28 




Figure 14: Alpgen systematics at the LHC. (a) and (b) show the p± spectrum of the W, (c) 
shows the pseudo-rapidity distribution of the leading jet, (d) shows the AR separation between 
the two leading jets, and (e)-(g) show the di {i = 1,2,3) spectra, where di is the scale in a 
parton-level event where i jets are clustered into i — 1 jets using the /c_L-algorithm. The full 
line is the default settings of Alpgen, the shaded area is the range between ALscL and ALscH, 
while the points represent ALptSO and ALpt40 as defined in section [3l 
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6.1 Alpgen systematics 

The Alpgen distributions for the Tevatron are shown in fig. [T31 The pattern of variations 
is consistent with the expectations. In the case of the p±w spectra, which are plotted in 
absolute scales, the larger variations are due to the change of scale, with the lower scale 
leading to a harder spectrum. The ±20% effect is consistent with the scale variation of 
as, which dominates the scale variation of the rate once p±w is larger than the Sudakov 
region. The change of matching scales only leads to a minor change in the region GeV < 
P±w < 40 GeV, confirming the stability of the merging prescription. 

In the case of the rapidity spectrum, we notice that the scale change leaves the shape 
of the distribution unaltered, while small changes appear at the edges of the rj range. The 
di distributions show agreement among the various options when ^/di < 10 GeV. This is 
due to the fact that the region y/dl < 10 GeV is dominated by the initial-state evolution 
of an n = i — 1 parton event, and both the matching and scale sensitivities are reduced. 
Notice that in the Alpgen prescription the scale for the shower evolution is kept fixed 
when the renormalization scale of the matrix elements is changed, as a way of exploring 
the impact of a possible mismatch between the two. 

For > Ef^^ the jet transverse energies are themselves typically above Ef^^, and 
the sensitivity to matching thresholds smaller than Ef^^ is reduced, since if the event 
matched at E'^'^'^, it will also match below that. Here the main source of systematics is 
therefore the scale variation, associated to the hard matrix element calculation for the 
n = i jet multiplicity. The region 10 < < E'^^ is the transition region between the 
dominance of the shower and of the matrix element description of hard radiation. The 
structure observed in the di distributions in this region refiects the fact that shower and 
matrix element emit radiation with a slightly different probability. The selection of a 
matching threshold, which leads to effects at the level of ±20% and is therefore consistent 
with a LL accuracy and can be used to tune to data. 

For the LHC, the Alpgen systematics is shown in fig. [T31 The comparison of the 
various parameter choices is similar to what we encountered at the Tevatron, with vari- 
ations in the range of ±20% for the matching-scale systematics, and up to 40% for the 
scale systematics. The pattern of the glitches in the di spectra for the different matching 
thresholds is also consistent with the explanation provided in the case of the Tevatron. 

6.2 Ariadne systematics 

The Ariadne systematics for the Tevatron is shown in fig. [151 Since the dipole cascade 
by itself already includes a matrix-element correction for the first emission, we see no 
dependence on the merging scale in the p±_w, Vjeti and di distributions, which are mainly 
sensitive to leading order corrections. For the other distributions, we become sensitive 
to higher-order corrections, and here the pure dipole cascade underestimates the matrix 
element and also tends to make the leading jets less back-to-back in azimuth. The first 
effect is expected for all parton showers, but is somewhat enhanced in ARIADNE due to 
the missing initial-state q ^ gq splitting, and is mostly visible in the ^2 distribution just 
below the merging scale. The second effect is clearly visible in the Ai?i2 distribution, 
which is dominated by low E±_ jets. 

The changing of the soft suppression parameter in ARs has the effect of reducing the 
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Figure 15: Ariadne systematics at the Tevatron. The plots are the same as in fig. [131 The 
full line is the default settings of ARIADNE, the shaded area is the range between ARscL and 
ARscH, while the points represent ARpt20, ARptSO and ARs as defined in section [3l 
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Figure 16: Ariadne systematics at the LHC. The plots are the same as in fig. (TH The full hne 
is the default settings of ARIADNE, the shaded area is the range between ARscL and ARscH, 
while the points represent ARptSO, ARpt40 and ARs as defined in section [3l 
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available phase space of gluon radiation, especially for large E± and in the beam directions, 
an effect, which is mostly visible for the hardest emission and in the p±w distribution. As 
for Alpgen, and also for the other codes, the change in scale mainly affects the hardness 
of the jets, but not the ?7jeti and the AR12 distribution. 

For the LHC, the ARIADNE systematics is shown in fig. [161 Qualitatively we find the 
same effects as in the Tevatron case. In particular we note the strong dependence on the 
soft suppression parameters in ARs, and it is clear that these have to be adjusted to fit 
Tevatron (and HERA) data before any predictions for the LHC can be made. It should 
be noted, however, that while eg. the high p±w tail in fig. [T6k for ARs is shifted down to 
be comparable to the other codes (cf. fig. fTOk). the medium p±w values are less affected 
and here the differences compared to the other codes can be expected to remain after a 
retuning. 

This difference is mainly due to the fact that the dipole cascade in ARIADNE, contrary 
to the other parton showers, is not based on standard DGLAP evolution, but also allows 
for evolution, which is unordered in transverse momentum a la BFKlH. This means that 
in Ariadne there is also a resummation of logs of 1/x besides the standard log resum- 
mation. This should not be a large effect at the Tevatron, and the differences there can 
be tuned away by changing the soft suppression parameters in Ariadne. However, at the 
LHC we have quite small values, x ~ miy/y/S < 0.01, which allow for a much increased 
phase space for jets as compared to what is allowed by standard DGLAP evolution. As 
a result one obtains larger inclusive jet rates as documented in table [3l The same effect 
is found in DIS at HERA, where x is even smaller as are the typical scales, Q^. And 
here, all DGLAP-based parton showers fail to reproduce final-state properties, especially 
forward jet rates, while ARIADNE does a fairly good job. 

It would be interesting to compare the merging schemes presented here also to HERA 
data to see if the DGLAP based shower would better reproduce data when merged with 
higher-order matrix elements. This would also put the extrapolations to the LHC on safer 
grounds. However, so far there exists one preliminary such study for the Ariadne case 
only [53]. 

6.3 Helac systematics 

The Tevatron HelAC distributions are shown in fig. [T71 Since HelAC results presented in 
this study are based on the MLM matching prescription, we expect the Helac systematics 
to follow at least qualitatively the Alpgen ones and this is indeed the case. On the other 
hand the use by HELAC of PythiA, for parton showering as well as for hadronization, leads 
to differences compared to the Alpgen results, where HerwiG is used. For the absolute 
rates, especially in the multi-jet regime, HELAC seems to be closer to MadEvent that 
also uses Pythia. 

For the LHC, the HELAC systematics are shown in fig. [THl The systematics follows 
a similar pattern compared to that already discussed for the Tevatron case, with the 
expected increase of up to 40% from scale variations, due to the higher collision energy. 

^Thc dipole emission of gluons in ARIADNE are ordered in transverse momentum, but not in rapidity. 
Translated into a conventional initial-state evolution, this corresponds to emissions ordered in rapidity 
but unordered in transverse momentum. 
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Figure 17: Helac systematics at the Tevatron. The plots are the same as in fig. [T3j The full 
line is the default settings of Helac, the shaded area is the range between HELscL and HELscH, 
while the points represent HELptSO as defined in section [3l 
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Figure 18: Helac systematics at the LHC. The plots are the same as in fig. [TH The full line 
is the default settings of HELAC, the shaded area is the range between HELscL and HELscH, 
while the points represent HELpt40 as defined in section [3l 
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Figure 19: MadEvent systematics at the Tevatron. The plots are the same as in fig. [T31 The 
full line is the default settings of MadEvent, the shaded area is the range between MEscL and 
MEscH, while the points represent MEkt20 and MEktSO as defined in section [31 
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Figure 20: MADE vent systematics at the LHC. The plots are the same as in fig. [HI The full 
line is the default settings of MadEvent, the shaded area is the range between MEscL and 
MEscH, while the points represent MEktSO and MEkt40 as defined in section [3l 
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6.4 MadEvent systematics 

The MadEvent distributions for the Tevatron are shown in fig. (THl Also here, the varia- 
tions are consistent with the expectations. For the p±w spectrum, the dominant variations 
are due to the change of scale for as, with the lower scale leading to a harder spectrum. 
Below the k±-cutoS, where the distribution is determined by the parton shower only, the 
lower scale gives the lower differential cross section. 

At Tevatron energies, both the p±w spectrum and the di spectra are relatively sta- 
ble with respect to variations of the matching scale. For the di spectra, the variation 
in matching scale gives a dip in the region 10 GeV < y/di < k±o, but is reduced for 
larger d^. The rapidity and jet-distance spectra show a remarkable stability under both 
renormalization-scale changes and variations in the cutoff scale. 

For the LHC, the systematics of the MadEvent implementation are shown in fig. [201 
The variations in renormalization scale give a very similar effect as for the Tevatron, with 
variations up to ±20% on the p± and di spectra. For variations in the matching scale 
k±o, however, the pattern is slightly different. This can be most easily understood from 
looking at the di spectra, since, as in the Sherpa case, the cutoff scale is defined to be just 
the di, so the transition between the parton-shower and matrix-element regions is very 
sharp. It is clear from these distributions that the default parton shower of Pythia does 
not reproduce the shape of the matrix elements at LHC energies even for relatively small 
k±, but falls off more sharply. There is therefore a dip in all the distributions around 
log/cxo; which gets more pronounced for the higher multiplicity distributions, and hence 
gives lower overall jet rates. The p±w distributions, as well as the di distributions, are 
composed of all the different jet-multiplicity samples, which gives systematically reduced 
hardness of the differential cross sections for increased cutoff scales. These effects are 
clearly visible also in Sherpa, which uses a PYTHIA-Iike parton shower and k± as merging 
scale. 

6.5 Sherpa systematics 

The systematics of the CKKW algorithm as implemented in SHERPA is presented in 
fig. EH for the Tevatron case. The effect of varying the scales in the PDF and strong 
coupling evaluations by a factor of 0.5 (2.0) is that for the lower (higher) scale choice, 
the VT-boson's p± spectrum becomes harder (softer). For this kind of observables the 
uncertainties given by scale variations dominate the ones emerging through variations 
of the internal separation cut. This is mainly due to a reduced (enhanced) suppression 
of hard-jet radiation through the as rejection weights. The differential jet rates, di^2,3, 
shown in fig. [2Tb-g. have a more pronounced sensitivity on the choice of the merging 
scale, leading to variations at the 20% level. In the CKKW approach this dependence can 
be understood since the /c^-measure intrinsically serves as the discriminator to separate 
the matrix-element and parton-shower regimes. Hence, the largest deviations from the 
default typically appear at di ~ k±o. However, the results are remarkably smooth, which 
leads to the conclusion that the cancellation of the dominant logarithmic dependence 
on the merging cut is well achieved. Moreover, considering the pseudo-rapidity of the 
leading jet and the cone separation of the two hardest jets, these distributions show a 
very stable behaviour under the studied variations, since they are indirectly infiuenced by 
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Figure 21: Sherpa systematics at the Tevatron. The plots are the same as in fig. [131 The full 
line is the default settings of Sherpa, the shaded area is the range between SHscL and SHscH, 
while the points represent SHkt20 and SHktSO as defined in section [3l 
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Figure 22: Sherpa systematics at the LHC. The plots are the same as in fig. (TH The full hne 
is the default settings of SHERPA, the shaded area is the range between SHscL and SHscH, while 
the points represent SHktSO and SHkt40 as defined in section [3l 
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the cut scale only. The somewhat more pronounced deviation at low Ai?i2 is connected to 
phase-space regions of jets becoming close together, which is affected by the choice of the 
merging scale and therefore by its variation. Taken together, Sherpa produces consistent 
results with relative differences of the order of or less than 20% at Tevatron energies. 

The Sherpa studies of systematics for the LHC are displayed in fig. [221 Compared 
to the Tevatron similar pattern of variations is recognized. The p± spectra of 

the boson show deviations under cut and scale variations that remain on the same 
order of magnitude. However, a noticeable difference is an enhancement of uncertainties 
in the predictions for low p±. This phase-space region is clearly dominated by the parton 
shower evolution, which in the SHERPA treatment of estimating uncertainties undergoes 
scale variations in the same manner as the matrix-element part. Therefore, the estimated 
deviations from the default given for low p± are very reasonable and reflect intrinsic 
uncertainties underlying the parton showering. For the LHC case, the effect is larger, 
since the evolution is dictated by steeply rising parton densities at x-values that are 
lower compared to the Tevatron scenario. The pseudo-rapidity of the leading jet and the 
cone separation of the two hardest jets show again a stable behaviour under the applied 
variations, the only slight exception is the regions of high |?7jeti| where, using a high k±- 
cut, the deviations are at the 20% level. The effect of varying the scales in the parton 
distributions and strong couplings now dominates the uncertainties in the differential jet 
rates, di^2,3, which are presented in fig. [22b-g. This time, owing to the larger phase space, 
for the low scale choice, /i = /io/2, the spectra become up to 40% harder, whereas, for the 
high scale choice, the spectra are up to 20% softer. The variation of the internal merging 
scale does not induce jumps around the cut region, however it has to be noted that for 
higher choices, e.g. k±o = 40 GeV, there is a tendency to predict softer distributions in 
the tails compared to the default. To summarize, the extrapolation from Tevatron to 
LHC energies does not yield significant changes in the predictions of uncertainties under 
merging-cut and scale variations; for the LHC scenario, they have to be estimated slightly 
larger, ranging up to 40%. The results are again consistent and exhibit a well controlled 
behaviour when applying the CKKW approach implemented in SHERPA at LHC energies. 

Giving a conservative, more reliable estimate, in SHERPA the strategy of varying the 
scales in the strong coupling together with the scales in the parton densities has been 
chosen to assess its systematics. So, to better estimate the impact of the additional 
scale variation in the parton density functions, renormalizat ion- scale variations on its 
own have been studied as well. Their results show smaller deviations wrt. the default 
in the observables of this study with the interpretation of potentially underestimating 
the systematics of the merging approach. Also, then the total cross sections vary less 
and become 9095 pb and 8597 pb for the low- and high-scale choice, respectively. Note 
that, owing to the missing simultaneous factorization-scale variation, their order is now 
reversed compared to SHscL and SHscH, whose values are given in table [31 Moreover, by 
referring to table[llthe cross-section ratios for e.g. a^-^^ / a'*"*' now read 0.26 and 0.22 for the 
low- and high-scale choice, respectively. This once more emphasizes that the approach's 
uncertainty may be underestimated when relying on a^-scale variations only. From table 
[3l it also can be noted that the total inclusive cross section given by the full high-scale 
prediction SHscH is - unlike Sherpa's default - close to the Alpgen default. In contrast 
to the MLM-based approaches, which prefer the factorization scale in the matrix-element 
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evaluation set through the transverse mass of the weak boson, the Sherpa approach makes 
the choice of employing the merging scale k±Q instead. This has been motivated in [9] 
and further discussed in [3]. Eventually, it is a good result that compatibility is achieved 
under this additional PDF-scale variation for the total inclusive cross sections, however 
it also clearly stresses that there is a non-negligible residual dependence on the choice of 
the factorization scale in the merging approaches. 

6.6 Summary of the systematics studies 

Starting with the p±w spectra, we find a trivial 20 — 40% effect of the scale changes, with 
the lower scale leading to a harder spectrum. In the case of Alpgen and Helac, this 
only affects the spectrum above the matching scale, while for ARIADNE, MadEvent and 
Sherpa there is also an effect below, as there the scale change is also implemented in 
the parton shower. For all the codes the change in merging/matching scale gives effects 
smaller than or of the order of the change in as scale. For ARIADNE, the change in the 
soft suppression parameter (ARs) gives a softer spectrum, which is expected as it directly 
reduces the phase space for emitted gluons. 

In the ?7jeti and Ai?i2 distributions the effects of changing the scale in are negligi- 
ble. In all cases, changing the merging/matching scale also has negligible effects on the 
rapidity spectrum, while the Ai?i2 tends to become more peaked at small values for larger 
merging/matching scales, and also slightly less peaked at Ai?i2 = tt. This effect is largest 
for Ariadne while almost absent for Helac. 

Finally for the di distributions we clearly see wiggles of varying sizes introduced by 
changing the merging scales. 

7 Conclusions 

This document summarizes our comparisons of five independent approaches to the prob- 
lem of merging matrix elements and parton showers. The codes under study, Alpgen, 
Ariadne, Helac, MadEvent and Sherpa, differ in which matrix-element generator is 
used, which merging scheme (CKKW or MLM) is used and the details in the implemen- 
tation of these schemes, as well as in which parton shower is used. 

We find that, while the three approaches (CKKW, L, and MLM) aim at a simula- 
tion based on the same idea, namely describing jet production and evolution by matrix 
elements and the parton shower, respectively, the corresponding algorithms are quite dif- 
ferent. The main differences can be found in the way in which the combination of Sudakov 
reweighting of the matrix elements interacts with the vetoing of unwanted jet production 
inside the parton shower. This makes it very hard to compare those approaches analyt- 
ically and to formalise the respective level of their logarithmic dependence. In addition, 
the different showering schemes used by the different methods blur the picture further. 
For instance virtuality ordering with explicit angular vetoes is used in SHERPA as well as 
in the HELAC and MadEvent approach employ Pythia to do the showering, ordering 
is the characteristic feature of ARIADNE, and, through its usage of HerwiG it is angular 
ordering that enters into the AlpGEN merging approach. However, although the formal 
level of agreement between the codes is not worked out in this publication and remains 
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unclear, the results presented in this publication show a reasonably good agreement. This 
proves that the variety of methods for merging matrix elements and parton showers can 
be employed with some confidence in vector boson plus jet production. 

The comparison also points to differences, in absolute rates as well as in the shape 
of individual distributions, which underscore the existence of an underlying systematic 
uncertainty. Most of these differences are at a level that can be expected from merging 
tree-level matrix elements with leading-log parton showers, in the sense that they are 
smaller than, or of the order of, differences found by making a standard change of scale in 
Os- In most cases the differences within each code are as large as the differences between 
the codes. And as the systematics at the Tevatron is similar to that at the LHC, it is 
conceivable that all the codes can be tuned to Tevatron data to give consistent predictions 
for the LHC. To carry out such tunings, we look forward to the publication by CDF and 
D0 of the measured cross sections for distributions such as those considered in this paper, 
fully corrected for all detector effects. 
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